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I. INTRODUCTION 

Semileptonic decays of B mesons to the r lepton — 
the heaviest of the three charged leptons — provide a 
new source of information on Standard Model (SM) pro- 
cesses [HJl,Hj_as well as a new window on physics beyond 
the SM PTE BBS, II- In the SM , semileptonic decays 
occur at tree level and are mediated by the W boson, but 
the large mass of the r lepton provides sensitivity to addi- 
tional amplitudes, such as those mediated by a charged 
Higgs boson. Experimentally, b — » ct~V t decays 1 are 
challenging to study because the final state contains not 
just one, but two or three neutrinos as a result of the t 
decay. 

Theoretical predictions for semileptonic decays to ex- 
clusive final states require knowledge of the form factors, 
which parametrize the hadronic current as functions of 
Q 2 = [pb — Pdm} 2 - For light leptons I = e, fi 2 there is ef- 
fectively one form factor for B — > Di^Vg, while there are 
three for B — > D*£~Vg. If a r lepton is produced instead, 
one additional form factor enters in each mode. The form 
factors for B — > D^l~Vf_ decays 3 involving the light 
leptons have been measured 1(| HH, E3] , providing direct 
information on four of the six form factors. Heavy quark 
symmetry (HQS) relations [H| allow one to express the 
two additional form factors for B — > £H*V~I7 T in terms 
of the form factors measurable from decays with the light 
leptons. With sufficient data, one could probe the addi- 
tional form factors and test the HQS relations. 

Branching fractions for semileptonic B decays to r lep- 
tons are predicted to be smaller than those to light lep- 
tons. Calculations based on the SM predict B(B° — > 
D+t-V t ) = (0.69 ± 0.04)% and B(B° -> D*+t-V t ) = 
(1.41±0.07)% 0, which account for most of the predicted 
inclusive rate B(B -> X cT -V T ) = (2.30±0.25)% [2] (here, 
X c represents all hadronic final states from the b — > c 
transition) . In multi-Higgs doublet models 0, H, H, 0, Hi , 
substantial departures, cither positive or negative, from 
the SM decay rate could occur for B(B — > Dt~V t ), while 
smaller departures are expected for B(B — ► D*t~V t ). 
Thus, measurements of B(B — > Dt~V t ) are more sen- 
sitive to non-SM contributions than either B(B — > 
D*t~V t ) or the inclusive rate. In addition to the branch- 
ing fractions, several other observables are sensitive to 
possible non-SM contributions, including q 2 distributions 
and D* and r polarization 0, i, i, H, 03] • 

The first measurements of semileptonic &-hadron de- 
cays to r leptons were performed by the LEP exper- 
iments [HI operating at the Z° resonance, yielding 
an average [l6( inclusive branching fraction *B(&had — > 



Xt~v t ) = (2.48±0.26)%, where 6 n ad represents the mix- 
ture of 6-hadrons produced in Z° — > bb decays. The 
Belle experiment has reported B(B° — > D* + t~V t ) = 
(2.02+°;«±0.37)% [13 • 

The BABAR Collaboration has presented a measure- 
ment of the branching fractions for B — > Dt~V t 
and B — > D*t~V t for both charged and neutral B 
mesons [18| . In this article, we describe the analysis in 
greater detail, with particular emphasis on several novel 
features of the event selection and fit technique. We also 
present distributions of two important kinematic vari- 
ables, the lepton momentum, |p||, and the squared mo- 
mentum transfer, q 2 . 



A. Analysis overview and strategy 

We determine the branching fractions of four exclusive 
decay modes: £T -> D q t-V t , B~ -> D*°T~V r , B° -> 
D + t~v t , and B° — > D* + t~v Ti each of which is mea- 
sured as a branching-fraction ratio R relative to the cor- 
responding e and /x modes. To reconstruct the r, we use 
the decays r~ — > e~v e v T and t~ — ► pTV^Pr, which are 
experimentally the most accessible. The main challenge 
of the measurement is to distinguish B — > D(*'t~V t de- 
cays, which have three neutrinos, from B — > D^t~Vi 
decays, which have the same observable final-state parti- 
cles but only one neutrino. 

The analysis strategy is to reconstruct the decays of 
both B mesons in the T(45) — > BB event, providing 
powerful constraints on unobserved particles. One B 
meson, denoted B tagl is fully reconstructed in a purely 
hadronic decay chain. The remaining charged particles 
and photons are required to be consistent with the prod- 
ucts of a b — > c semileptonic B decay: the daughter charm 
meson (either a D or D*) and a lepton (e or fx). The lep- 
ton may be either primary or from t~ — > l~Vgv r . To dis- 
tinguish signal events from the normalization modes B — > 
D(*)£~Ve, we calculate the missing four-momentum, 



Pn 



Pe+e- -Ptag ~ PdM ~ Pi 



(1) 



P 2 ■ 
L miss 



1 Charge-conjugate modes are implied throughout. 

2 Throughout this article, we use the symbol f. to refer only to the 
light charged leptons e and fi. 

3 The symbol Z?(*' refers either to a D or a D* meson. 



of any particles recoiling against the observed B tag 
D^£ system. A large peak at zero in m 2 liss 
corresponds to semileptonic decays with one neutrino, 
whereas signal events produce a broad tail out to ~ 
8 (GeV/c 2 ) 2 . 

To separate signal and background events, we perform 
a fit (described in Section lVlip to the joint distribution of 
TO miss ano - the lepton momentum (|p||) in the rest frame 
of the B meson. In signal events, the observed lepton 
is the daughter of the t and typically has a soft spec- 
trum; for most background events, this lepton typically 
has higher momentum. The fit is performed simultane- 
ously in eight channels, with a set of constraints relat- 
ing the event yields between the channels. The fit is 
designed to maximize the sensitivity to the B — > Dt~V t 
signals by using events in the D*l~ channels to constrain 
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the dominant backgrounds, B — > D*t~V t feed-down, in 
which the final-state D* meson is not completely recon- 
structed. Similarly, we use a set of D** control samples to 
constrain the feed-down background to both the Dt~V t 
and D*t~v t signals. 4 

We perform a relative measurement, extracting both 
signal B — ► D^t~V t and normalization B — > D^*'l~Vi 
yields from the fit to obtain the four branching-fraction 
ratios R(D°), R(D+), R(D*°), and R{D*+), where, 
for example, R(D*°) = B(B~ -> D*°t-V t )/B(B- -> 
D*°£-V e ). In the ratio, many systematic uncertainties 
cancel, either partially or completely. These ratios are 
normalized such that i represents only one of e or /i; 
however, both light lepton species are included in the 
measurement. We multiply these branching- fraction ra- 
tios by previous measurements of B(B — >• D(*>l~Vi) to 
derive absolute branching fractions. 



II. THE BABAR DETECTOR AND DATA SETS 

We analyze data collected with the BABAR detector at 
the PEP-II e + e~ storage rings at the Stanford Linear Ac- 
celerator Center. PEP-II is an asymmetric-energy B fac- 
tory, colliding 9.0 GeV e~ with 3.1 GeV e + at a center- 
of-mass energy of 10.58 GeV, corresponding to the T(4S) 
resonance. The data sample used consists of 208.9 fb _1 
of integrated luminosity recorded on the T(AS) resonance 
between 1999 and 2004, yielding 232 x 10 6 T(45) -> BB 
decays. This data sample can be divided into two ma- 
jor periods: Runs 1-3, comprising 109.0 fb _1 taken from 
1999 to June 2003, and Run 4, comprising 99.9 fb _1 
taken from September 2003 to July 2004. The accel- 
erator background conditions were significantly different 
between Runs 1-3 and Run 4, which could affect missing- 
energy analyses such as this one; for this reason, the two 
running periods have been independently validated, and 
the fraction of signal-like events found in the Run 4 sam- 
ple is used as a crosscheck of the results, as described in 
Section H 

The BABAR detector is a large, general-purpose mag- 
netic spectrometer and is described in detail else- 
where [19] . Charged particle trajectories are measured in 
a tracking system consisting of a five-layer double-sided 
silicon strip detector and a 40-layer drift chamber, both 
of which operate in the 1.5 T magnetic field of a super- 
conducting solenoid. A detector of internally reflected 
Cherenkov light (DIRC) is used to measure charged par- 
ticle velocity for particle identification (PID). An electro- 
magnetic calorimeter (EMC), consisting of 6580 CsI(Tl) 
crystals, is used to reconstruct photons and in electron 
identification. The steel flux return of the solenoid is seg- 



4 Throught this paper, we use the symbol D** to represent all 
charm resonances heavier than the D*(2010), as well as non- 
resonant _D'*'mr systems with n > 1. 



mented and instrumented with resistive plate chambers 
(IFR) for muon and neutral hadron identification. 

All detector systems contribute to charged particle 
identification. Ionization energy losses in the tracking 
systems and the Cherenkov light signature in the DIRC 
are used for all charged particle types. Electrons are also 
identified on the basis of shower shape in the EMC and 
the ratio of energy deposited in the EMC to the track 
momentum. Muon identification is based on a minimum- 
ionization energy deposit in the EMC and on the mea- 
sured interaction length in the IFR. 

This analysis relies on measurement of the missing mo- 
mentum carried off by multiple neutrinos, and the large 
solid angle coverage (hermeticity) of the detector is there- 
fore crucial. The tracking system, calorimeter, and IFR 
cover the full azimuthal range and the polar angle range 
from approximately 0.3 < 8 < 2.7 rad in the laboratory 
frame, corresponding to a T(AS) center-of-mass coverage 
of approximately 90% (the direction 9 = corresponds to 
the direction of the high-energy beam, and therefore to 
the T(45) boost). The DIRC fiducial volume is slightly 
smaller, corresponding to a center-of-mass frame cover- 
age of about 84%. 

Within the active detector volume, the efficiency for 
reconstructing charged tracks and photons is very high, 
typically greater than 95% over most of the momentum 
range. At low momenta, however, the reconstruction ef- 
ficiency drops off, leading to an increased contribution 
from feed-down processes to which special attention is 
paid throughout this analysis. Feed-down occurs when 
the photon from D* — > or the 7r° from D* — > Dtt° 
is not reconstructed (in the case of the 7r°, either one or 
both of the photons from tt° — > 77 may be missed) . Care 
must therefore be taken to avoid confusing D* feed-down 
events for D signals. 

We use a Monte Carlo simulation (MC) of the produc- 
tion and decay of signal and background events based 
on EvtGen [2fJ. A sample of simulated inclusive BB 
events equivalent to about five times the integrated lu- 
minosity is used to study backgrounds and to optimize 
event selection criteria. Large samples of many individual 
semileptonic B decays (discussed in Section ITO|) are used 
to parameterize the distributions of variables used in the 
fit. Final-state radiation is simulated using PHOTOS (2lj . 
Simulation of the detector response is performed with 
GEANT [22j and the resulting efficiencies and resolutions 
are validated in multiple data control samples. 

III. SEMILEPTONIC DECAY MODELS 

In the SM, the matrix element for a semileptonic B 
meson decay can be written as 

2 

M (B -> flW(r/r-)p) = -i-V^I% , (2) 
where g is the weak coupling constant, mw the W mass, 
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V c b the quark mixing matrix element, and L M and are 
the leptonic and hadronic currents, respectively. Here, 
we have used a simplified form for the W propagator ap- 
propriate for energies much less than m\y. The leptonic 
current is exactly known, 

L" = UiYXl -75K, (3) 
and the hadronic current is given by 

H^ = (D^\c^(l- l5 )b\B) . (4) 

In the case of a B — > D transition, the axial-vector 
part of the current does not contribute to the decay, and 
we may write the hadronic current in terms of two form 
factors f+(q 2 ) and f~{q 2 ): 

(D\V»\B) = (p + pYf + (q 2 ) + (p-p'Tf-(q 2 ) , (5) 

with = cj^b and where p and p 1 are the four-momenta 
of the B and D mesons, respectively. For the B — > D* 
transition, the axial-vector term contributes to the decay 
as well, and we write the hadronic current in terms of 
form factors V(q 2 ), Ax(q 2 ), A 2 (q 2 ), A 3 (q 2 ), and A (q 2 ): 



0.18 [28| for B — > D£ vi and B Dt v r decays, and we 
use R\ = 1.417±0.061±0.044, Ro = 0.836±0.037±0.022, 
and p 2 Ai = 1.179±0.048±0.028 HH for B -> D*l~Vt and 
B — > D*t~V t decays. 5 Variation of these form factors is 
taken into account as a systematic uncertainty, including 
the correlations between the three B — > D* form factor 
parameters. 

Figures HH3] show distributions of three kinematic vari- 
ables important to this analysis, all generated using the 
CLN form factor parameterization with parameters given 
above. Figure [T] compares q 2 distributions between the 
signal and normalization modes. Signal events must sat- 
isfy q 2 > m 2 , leading to qualitatively different q 2 spectra 
for signal and normalization events; this feature is ex- 
ploited in the event selection and in validation studies. 
Figure [2] shows distributions of lepton energy in the B 
meson rest frame. While the t~ lepton in signal events 
typically has high energy (due to its mass), the secondary 
lepton £~ typically has much lower energy than either the 
r~ or the primary lepton in B — > D^£~Vg events. This 
low lepton energy leads to a lower reconstruction effi- 
ciency for signal leptons than those in the normalization 
modes. Figure [3] shows distributions of for the two 

signal modes, which, due to the three neutrinos in these 
events, forms a broad structure up to very large TO^ iss . 



(D*\V»-A»\B) = ———elp' a p V(q 2 )- 



(m B +m D «)e*»A 1 {q 2 ) + 



TUB + WD* 

e*-q 



to b + m,D* 



-(p + P YA 2 (q 2 ) + 



2 mD , L_l q »A 3 ( q 2 ) - 2m D , £ —l q n Ao (q 2 ) , (6) 



q 



where A^ = 07^756 and e is the D* polarization vec- 
tor. The form factor A 3 (q 2 ) is related to two other form 
factors as 



Mq 2 ) 



2m 1 



-A^q 2 )- 



2m 1 



-A 2 (q 2 ) (7) 



so that there are only four independent form factors. 

In the limit of massless leptons, any terms proportional 
to <7 M = {p—p')^ vanish when the hadronic current is con- 
tracted with the leptonic current. For this reason, the 
contributions from the form factors f-(q 2 ) and Ao(q 2 ) 
are essentially negligible for electrons and muons, as men- 
tioned above. 

Semileptonic decays are simulated using the ISGW2 
model [23], except for B — > D*l~T>i decays, which use an 
HQET model with a linear form factor expansion [2^ |. 
and nonresonant B — > D^tt£~v£ decays, which use the 
model of Goity and Roberts [25[. We reweight both sig- 
nal B — * D(*'t~V t and normalization B — > D^*'l~Vi 
events (2(| so that the decay distributions follow the 
Caprini-Lellouch-Neubert (CLN) form factor model [27j 
with parameters measured in data. We use p 2 ^ — 1.17 ± 



IV. EVENT RECONSTRUCTION AND 
SELECTION 

All event selection requirements (as well as the fit pro- 
cedure described in Section [VII[) are defined using simu- 
lated events or using control samples in data that exclude 
the signal region in order to avoid any potential sources 
of bias. About 60% of the BB MC sample is used in 
optimizing the event selection, while the remaining 40% 
is used as an independent validation of the selection and 
fitting procedures. 

Most of the selection criteria described here are opti- 
mized to maximize the quantity S/ \/~S + B, where S and 
B are the expected signal and background yields in the 
large TO^ iss region of our data sample, assuming Stan- 
dard Model branching fractions for signal decays. The 
requirement on AE of the -Btag candidate (defined below) 
was initially optimized in the same way, but was tight- 
ened because fits to MC samples indicated that events 
at large |AS| contributed to biases in the signal extrac- 
tion. The final selection corresponds to a compromise 



5 The parameters Ri and R2 are not included in the model of 
Caprini, Lellouch, and Neubert |27| ; to model the B — > D* form 
factors, we adopt the formalism used in [l2ll . Eqs. (13-14), where 
the leading terms in these form factor ratio expansions are taken 
as free parameters. We use independent slope parameters 
and for the B — » D and B — » D* form factors, respectively, 
treating the two sets of form factors as uncorrelated. 
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FIG. 1: Generated q 2 distributions for (a) B — > Di? and 
B -> Dt~V t ; (b) B -> D*£~Vt and B -> D*t~V t . The two 
curves in each plot show q 2 for the light lepton (dashed) and 
for the t (solid). All distributions use the CLN form factor 
model with experimentally-measured shape parameters. The 
distributions are normalized to equal areas. 



between the statistical S/y S + B optimization and the 
systematic effects due to this bias. 



FIG. 2: Generated lepton energy distributions for (a) B — > 
Drv e and B -> Dt~V t ; (b) B -> D*£~V e and B -» D*t~V t . 
The three curves in each plot show the £ _ energy in B — » 
D^rVe (dashed), the r" energy in B -> D^t~V t (solid), 
and the secondary lepton energy in B — > D^*'t~V t (dotted), 
all defined in the B meson rest frame. All distributions use the 
CLN form factor model with experimentally-measured shape 
parameters. The distributions are normalized to equal areas. 



isfy PID criteria (loose criteria for D° — > K ir + , tight 



B tag Reconstruction 



D\J g candidates are selected within 



for all other modes. 6 ) 

about 2cr (standard deviations) of the nominal mass, with 
a depending on the reconstruction mode and typically 
We reconstruct B tag candidates in 1114 final states 5_io MeV/c 2 for the D tag mass and 1-2 MeV/c 2 for the 

tag 



D^Y ± with an algorithm that has been used £)* - D t mass difference 



B 

previously at BABAR for a number of analyses, espec ially 
those dependent on measuring missing momentum [29j. 
These final states arise from the large number of ways 
to reconstruct the D and D* mesons within the Btag 
candidate and the possible pion and kaon combinations 
within the system. Tag-side D candidates are re- 



tag — -^tag 

We use two kinematic variables to identify -Btag candi- 
dates, 



constructed as D{? 



K~ 



K~ 



and as D^ ag 



and K®n + ii 

dates are reconstructed as D^ g 



and K%+tt°. Tag-side D* candi- 
D° tag n° and D° ag7 

and as D£j — > D® ag n + . The Y^ system may consist of 
up to six light hadrons (tt^, tt°, K^, or Kg). In both 
the -Dtag an< ^ ^ ± systems, we reconstruct 7r° — > 77 and 
Kg — > tt + it~ and require charged kaon candidates to sat- 



iPtag 



(8) 



6 The terms "loose" and "tight" refer to the relative signal-to- 
background discrimination of various PID criteria. Loose crite- 
ria are chosen to have high efficiency, and have relatively high 
background rates as well; tight criteria have lower background 
but also glower signal efficiency. The optimal choice of criteria 
depends on the particle type and on the a priori purity of the 
sample, and is therefore different for each reconstruction channel. 
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FIG. 3: Generated m^iss distributions for (a) B — » Dt~V t 
and (b) B — » D*t~V t . Both distributions use the CLN form 
factor model with experimentally-measured shape parame- 
ters. 



and 



AE = £ t ag - v^/2 



(9) 



where ^J~s is the total e + e~ energy, |ptag| is the magnitude 
of the Stag momentum, and E tag is the Stag energy, all 
defined in the e + e~ center-of-mass frame. For correctly 
reconstructed B tag candidates, tjjes is equal to the B 
meson mass, with a resolution of about 2.5 MeV/c 2 , and 
AE is equal to zero, with a resolution of about 18 MeV. 

For each D\J g "seed" candidate, we use a recursive al- 
gorithm to identify candidate Y^ systems. Light hadrons 
from the remaining tracks and photons in the event are 
added to the system, one at a time. If the result- 
ing values of toes and AE for the D^gF candidate are 
close to the nominal values, the -B tag candidate is ac- 
cepted. If the value of AE is too large, the light hadron 
just added is removed from the Y system, since contin- 
uing to add particles to this Y ± candidate will increase 
AE further. The algorithm then continues recursively 
with the remaining particles in the event, adding and 
removing light hadrons to the Y system according to 
toes, AE, and the Y system topology. This algorithm 
is semiexclusive, meaning that particles in the Y sys- 
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FIG. 4: Distributions of ities in selected events, (a) shows 
all -Btag candidates reconstructed in 20 fb _1 of data, and the 
purity of this plot has been increased by requiring \AE\ < 
50 MeV. (b) shows the distribution for the complete data 
sample after the signal B and total-event selection require- 
ments. Note the substantial improvement in purity due to 
the complete reconstruction. 



tern are not constrained to intermediate resonance states. 
Because of this, the yield is significantly higher than ex- 
clusive B reconstruction, while the purity is somewhat 
lower. In this analysis, however, since we exclusively re- 
construct the second B meson in the event, the purity of 
our final sample is substantially improved with respect 
to the raw Btag sample. 

We require m ES > 5.27 GeV/c 2 and \AE\ < 72 MeV, 
corresponding to ±4er in AE and — 4tr in toes (the kine- 
matic limit toes < provides an effective +4cr re- 
quirement). We reconstruct Stag candidates with an ef- 
ficiency of 0.2% to 0.3%. Figure 2] shows distributions of 
Toes for selected B tag candidates both before and after 
the signal-side reconstruction. We make no attempt at 
this stage to select a single -Btag among multiple recon- 
structed candidates: this decision is made after recon- 
structing the signal side as well. 
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B. Reconstruction of the Signal B 

For the B meson decaying semileptonically, we re- 
construct £)W candidates in the modes D° — > K~tt + , 

K-JT+TT , K-TT+TT+TT-, K° S TT+ ^ ] D+ -> K-^+TT+, 

K-tt+tt+tt , K° s tt+, K~K+tt + ; D*° -> D°tt°, Z?°7; and 
D* + — > Z? 7r+, Z5+7T . We reconstruct mesons as 
iT° -> 7T+7T- with 491 < m^- < 506MeV/c 2 , corre- 
sponding to ±3cr. We reconstruct ir° mesons as 7r° — > 77, 
requiring 90 < m 77 < 165MeV/c 2 for the soft tt° used to 
reconstruct D* —* Dir^ o{t , and requiring E 11 > 200 MeV 
and 125 < m 77 < 145 MeV/ c 2 for a 7r° used to recon- 
struct a -D meson; the mass intervals correspond to ±3a 
in both cases and are different because the resolution 
is poorer at low energies. Charged kaon candidates are 
required to satisfy tight PID criteria with a typical ef- 
ficiency of 85% while rejecting 98% of pions. Charged 
pion candidates are required to satisfy loose PID criteria 
with a typical efficiency of 97% while rejecting 88% of 
kaons. D (D*) candidates are selected within 4cr of the 
D mass (D* — D mass difference); as on the tag side, a 
is typically 5-10 MeV/c 2 (1-2 MeV/c 2 ). 

Electron candidates are required to satisfy tight PID 
criteria and to have lab-frame momentum |p e | > 
300MeV/c, with an efficiency that rises from 85% at 
the lowest momenta to 95% for |p e | > 1.0 GeV/c. 
Muon candidates are required to satisfy tight PID cri- 
teria; since muon PID relies on the hit pattern in the 
IFR, this effectively requires |p M | gl 600MeV/c, and re- 
sults in an efficiency of 40%-60% over the allowed mo- 
mentum range. The energy of electron candidates is 
corrected for bremsstrahlung energy loss if photons are 
found close to the electron direction. Lepton candidates 
of either flavor are required to have at least 12 hits in 
the drift chamber and to have a laboratory-frame po- 
lar angle 0.4 < 9 < 2.6 rad (excluding the very forward 
and very backward regions of the tracking system) in 
order to ensure a well-measured momentum, since mis- 
measured lepton momenta distort the n^iss distribution 
and tend to move background events into the signal-like 
region. Approximately 5% of selected lepton candidates 
are misidentified, almost all of which are pions misrecon- 
structed as muons. 



C. Total-Event and Single-Candidate Selection 

We form whole-event candidates by combining J5 tag 
candidates with D^*H~ candidate systems. We combine 
charged Stag candidates with D^*'°£~ systems and neu- 
tral B tag candidates with both D^ + (r and D^~£ + sys- 
tems, where the inclusion of both charge combinations 
allows for neutral B mixing. 

In correctly reconstructed signal and normalization 
events, all of the stable final-state particles, with the ex- 
ception of the neutrinos, are associated with either the 
-Btagi D(*\ or £~ candidate. Events with additional par- 
ticles in the final state must therefore have been mis- 
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FIG. 5: Distributions of -Eextra in simulated events, shown 
after all event selection except the cut on -E oxtr a itself. Sig- 
nal B — » Z)'*'t~77 t events are shown as points, while B — > 
D^*'l~V£ normalization events are shown as the histogram. 
The gap between the -E GX tra = bin and the remainder of the 
distribution corresponds to the minimum allowed photon en- 
ergy, 50 MeV. The normalization is arbitrary. The agreement 
between the two distributions indicates that the efficiency of 
a cut on -E cx tra will cancel when we measure the branching- 
fraction ratio. 



reconstructed, and we suppress these backgrounds with 
two selection requirements on the "extra" particle con- 
tent in the event. We require that all observed charged 
tracks be associated with either the Stag, D^*\ or £ can- 
didate. We compute -E extra , the sum of the energies of all 
photon candidates not associated with the Btag + D^t 
candidate system, and we require -E e xtra < 150-300 MeV, 
depending on the channel. When considering these 
extra tracks and extra photons, care is taken to reject 
track and photon candidates which are likely to be due 
to accelerator background, electronics noise, or recon- 
struction software failures; fake photons in the EMC are, 
to some degree, unavoidable, which is why we can not 
simply require E ex tr& — 0. The different D modes have 
very different levels of combinatorial background, which 
the -Eextra cut is particularly effective at rejecting. Fig- 
ure [5] shows distributions of -Ecxtra for simulated signal 
and normalization events. Excellent agreement is seen 
in the two distributions, indicating that the efficiency 
of a cut on £7 e xtra will largely cancel when we measure 
the branching-fraction ratio; we observe the same level 
of agreement in the four D^*'£~ channels separately, as 
well as in the e and /j final states separately. 

We suppress hadronic events and combinatorial back- 
grounds by requiring |p miss | > 200 MeV/c, where |p miss | 
is the magnitude of the missing momentum. This re- 
quirement mainly rejects hadronic events such as B — > 
D^t:~ , where the ir~ is misidentified as a \x~ . Our selec- 
tion rejects more than 99% of B — > D^n^ background, 
while rejecting less than 1% of signal and other semilcp- 
tonic events. 



We further suppress background by requiring q 2 > 
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4 (GeV/c 2 ) 2 , where q 2 is calculated as 

Q 2 = [PB-PDM] 2 = [Pe+e~ -Ptag -PD(*)] 2 ■ (10) 

This requirement preferentially rejects combinatorial 
backgrounds from two-body B decays such as B — > 
J)WD, where one -D meson decays semileptonically (or, 
in the case of a Df, leptonically as Df — > t + v t ). Our 
selection rejects about 25% of these backgrounds, while 
the signal efficiency is about 98% because signal events 
automatically satisfy q 2 > m 2 w 3.16 (GcV/c 2 ) 2 . For 
B — > Dl~vi decays, the g 2 distribution peaks near zero 
(see Fig. [I]), so this selection has an efficiency of about 
60% for this normalization mode; for B — > D*£~vg de- 
cays, the q 2 distribution peaks at higher values, so our 
efficiency is about 70%. The q 2 requirement is the main 
reason why the reconstruction efficiency is different for 
signal and normalization modes, as seen below. 

If multiple candidate systems pass our selection in a 
given event, we select the one with the lowest value of 
-E/extra- This scheme preferentially selects the candidate 
that is least likely to have lost additional particles. The 
main effect of this algorithm is that a candidate in one 
of the D*£~ channels will be selected before a candidate 
in one of the D£~ channels when both candidates are 
present in an event. Because D* — > D feed-down is a 
dominant background while D — *■ D* fecd-up is compar- 
atively rare, keeping as many true D* events in the D*£~ 
reconstruction channels helps to increase the sensitivity 
to the Dt~v t signals. 

To improve the resolution on the missing momen- 
tum, we perform a kinematic fit [30( to all T(4S') — > 
B tag D(*H~ candidates. We constrain charged track 
daughters of K®, D, and B mesons to originate from com- 
mon vertices, and we constrain the T(4S) — > BB vertex 
to be consistent with the measured BABAR beamspot lo- 
cation. We constrain the mass of the signal D meson (and 
D* meson, if there is one) to the measured value [l6l |. and 
the combined momentum of the two B mesons to be con- 
sistent with the measured beam energy. 

D. D** Control Sample Selection 

We select four control samples to constrain the poorly 
known B — > D**(£~ /t~)V background. The selection is 
identical to that of the signal channels, but we require the 
presence of a ir° meson in addition to the B tag + D^*'i 
system. The 7r° candidate must have momentum greater 
than 400 MeV/c, and the event must satisfy E cxtla < 
500 MeV, where the two photons from 7r° — > 77 are 
excluded from the calculation of -Ecxtra- 

Most of the D** background in the four signal chan- 
nels occurs when the 7r° from D** — ► D(*'tt° is not re- 
constructed, so these control samples provide a direct 
normalization of the background source. Similar D** de- 
cays in which a ^ is lost contribute very little to the 
background since they do not have the correct charge 



TABLE I: Number of selected data events in the four sig- 
nal channels, 7V CV , and in the four D** control samples, 
Nd**cs- Here, the large m^ iss region is taken to be m^ iss > 
2 (GeV/c 2 ) 2 and corresponds to the region with greatest signal 
sensitivity. 



Channel N ev 


7V CV (large m^ ias 


) ATd»*cs 


D u r 1403 


121 


137 


D*°£- 790 


43 


77 


D+r 295 


36 


66 


D* + r 398 


14 


30 



correlation between the Stag and candidate, and 

decays with two missing charged pions, which may have 
the correct charge correlation, have very low reconstruc- 
tion efficiencies. The feed-down probabilities for the 
D**(£~ /t~)V background are determined from simula- 
tion, with uncertainties in the D** content treated as a 
systematic error as described in Sec. IIX A 31 

V. SELECTED EVENT SAMPLES 

After applying all of the criteria above, we select a to- 
tal of 3196 data events, 2886 in the four signal channels 
and 310 in the D** control samples, as listed in Tabled] 
Since most of the events at large are either Dt~V t 

or D*t~V t signal events, the third column in the Ta- 
ble gives a first indication of where our sensitivity comes 
from. There are more events in the two B~ channels, 
D°£~ and D*°£~, due to a larger efficiency to recon- 
struct charged -Btag candidates than neutral ones and, 
to a lesser extent, a larger efficiency to reconstruct D° 
mesons on the signal side than D + mesons. There are 
more events in the D channels than the D* channels, par- 
ticularly at large m 2 ^, because these channels contain 
both D mesons and D* feed-down. The greatest signal 
sensitivity therefore comes from the D°£~ channel. 

Figure [S] shows distributions of |p|| versus rn^^ for 
the selected data samples. One-dimensional distributions 
of w^iss and |p|| for these samples are shown when we 
discuss the signal fit in Section IVIII 

Figure [7] shows distributions of |p|| versus for 
several MC samples after applying all event selection cri- 
teria. While the composition of the event sample will 
be discussed in greater detail in the following section, 
these distributions exhibit the qualitative features of the 
data sample which are most relevant to our signal ex- 
traction. Figure [7Ja) shows D°£~Ve => D°£ ~ , where we 
introduce the =>■ notation to mean that these are true 
B~ — > D°£~T>g events reconstructed in the D°£~ channel. 
The w 2 n i ss distribution is very narrowly peaked around 
zero, as expected for one-neutrino events. Figure [7fb) 
shows D*°£~Vi D°£~, feed-down events where a D*° 
is misreconstructed as a D . In this case, the center 
of the m^jgg distribution is offset from zero, and this 
offset decreases with increasing |p||; this kinematic fea- 
ture is common to all feed-down processes, and is due 
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FIG. 6: Distributions of |p|| versus m^ iaa for selected data 
events in the four signal channels. 



to the fact that higher |p|| correspond to lower D* mo- 
menta and therefore to lower momenta for the lost tt° or 
7. The width of the rrt^^ distribution is also observed 
to decrease with increasing |p||, a feature which is also 
common to most distributions; this narrowing is partly 
due to the same kinematic effect as before, the reduced 
D* phase space at large |p||, and partly due to the fact 
that the lepton momentum resolution improves at higher 
momenta. 

Figures [7(c) and (d) show similar distributions for sig- 
nal events, (c) showing correctly reconstructed B~ — ► 
D°t~v t and (d) showing D*° feed-down; in both plots, 
the large values of w^iss due to the three neutrinos are 
clearly observed. Again, the distributions move 

towards zero and become narrower at high |p^|, in this 



case due to the reduced phase space for the multiple neu- 
trinos, although, in Fig. [7(d), the effect of the lost n° or 
7 can also be seen as a defecit along the lower-left edge 
of the distribution. Figure [7(e) shows feed-down from 
B — ► D**£~~Pg into the D°£~ channel, where, in addition 
to the neutrino, one or more 7r° mesons or photons from 
the D** decay have been lost. Since tt° mesons from 
D** decay typically have higher momentum than those 
from D* decay, the TO^ iss distribution is much broader 
than that in Fig. [7(b). Figure [7(f) shows the feed-up 
process D°£~Ve D*°£~, where a true D° meson is 
paired with a combinatorial tt° or 7 to fake a D*° can- 
didate. In this case, the m^ iss distribution is shifted in 
the opposite direction from Fig. [7(b). Figures [7(g), (h), 
and (i) show three additional distributions for events re- 
constructed in the D* n £~ channel, D* a £-V e , D*°t-V t 
signal, and D**£~vi background, respectively; each of 
these distributions is similar to the corresponding one in 
the D°£~ channel, Figs. [7(a), (c), and (e), respectively. 

Figures [7(j) and (k) show charge-crossfeed back- 
grounds: true B — > D(*'£~V£ events reconstructed with 
the wrong charge for both the -Btag and meson. Typ- 
ically this occurs when a low-momentum tt^ is swapped 
between the two mesons. Note that, even though the 
event is misreconstructed, this particle misassignment 
does not substantially alter the total missing momen- 
tum, so that the TO^lss distribution still peaks at or near 
zero. While the events in Fig. [7(k), which are recon- 
structed in the D* £~ channels, are very strongly peaked 
at m^ iss = 0, Fig. [7(j) includes a large feed-down com- 
ponent, and therefore exhibits the same sloping behavior 
seen in Fig. [7(b). 

Figure [7(1) shows the distribution for combinatorial 
background for all four signal channels. This back- 
ground is dominated by hadronic B decays such as 
B — * D^Ds that produce a secondary lepton, includ- 
ing events with r leptons from D s decay. 

In our BB MC sample, our criteria select D** control 
samples which are 60%-80% pure B — > D**£~vi events, 
of which more than 90% involve true D** -> £)W 7r tran- 
sitions. The remaining events are split between feed- 
up from B — > D^£~Ve and combinatorial background. 
In these control samples, the B — * D**£~vi component 
peaks at or near zero in rn^ iss , just as B — > D(*H~~Dt 
does in the four signal channels. The qualitative features 
of the other contributions are similar to what is seen in 
the signal channels. 



VI. KINEMATIC CONTROL SAMPLES 

The event selection criteria described in Section llVl are 
more complicated than those used in a typical BABAR 
analysis, due to the full-event reconstruction of a high- 
multiplicity final state and the need to veto events with 
extra tracks and neutral clusters. We use two data con- 
trol samples to validate our simulation with respect to 
the observed behavior in data. The control samples are 
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FIG. 7: (Color online) Distributions of |p|| versus m^ iss 
for several MC samples after all event selection. Red (light 
grey) regions indicate relatively high density of reconstructed 
events, while blue (dark grey) indicate relatively low density. 
Shown are (a) D°rv l =>■ D £~, (b) D*°rv e D°£~, (c) 
D°t~V t =>■ D°£~ , (d) D*°r-u T =>■ D°r, (e) D**£~V e => 

D°r, (f) D°rv e => D* r, ( g ) D*°rv e =>■ D*°r, (h) 

D*°t-V t D*°r, (i) D**l~Vi =>■ D*°£~, (j) charge- 
crossfeed reconstructed in the D°£~ and D + £~ channels, (k) 
charge-crossfeed reconstructed in the D*°£~ and D* + £~ chan- 
nels, and (1) combinatorial background in the four D^£~ 
channels. The reconstruction channel notation => and the 
features of these distributions are discussed in the text. 



kinematically selected, with no requirement on m^ iss , to 
be high purity samples of B — * D^*'l~Vn events, with 
little or no contamination from signal decays. 

The first control sample is defined by requiring the re- 
constructed lepton to satisfy |pj| > 1.5 GeV/c, and is 
therefore a subset of our full analysis sample. In simula- 
tion, 95% of the selected sample is B —* D^£~V£ (the 
two D*£~ channels are approximately 95% B — > D*£~T>e, 
while the two D£~ channels include both B — > Dl~vi 
and large feed-down from B — > D*l~vg). The remain- 
ing 5% of the sample is composed of about l%-3% 
B -> D**£~Vt, less than 1% B -> D^t~V t , and l%-2% 
of combinatorial backgrounds. 

For the second control sample, we remove the stan- 
dard q 2 > 4 (GeV/c 2 ) 2 selection and require that events 
instead satisfy q 2 < 5 (GeV/c 2 ) 2 , with q 2 calculated ac- 



cording to Eq. ([T0| . This control sample has very lit- 
tle overlap with our final event sample, where we re- 
quire q 2 > 4 (GeV/c 2 ) 2 . Although the two control sam- 
ples do have some overlap, this q- control sample has 
the advantage over the first of allowing us to examine 
events with low |p||, as expected for signal events. In 
simulation, approximately 90% of this control sample is 
B — > D^*'l~Vi (as in the first sample, the two D*£~ chan- 
nels are approximately 90% B — > D*l~Vt, while the two 
Dt~ channels include D* feed-down). The remainder of 
the sample is composed of about 3% B — > D**l~Vi, 3% 
B — * D(*^t~V t , and 4%-5% combinatorial backgrounds. 

Figure [8] shows several data-simulation comparisons in 
the two control samples. The four D^£~ channels have 
been combined in these plots as have the two control 
samples, and this union of the two control samples is re- 
sponsible for the large steps visible in (a) and (b). We 
see good agreement between data and simulation in these 
plots, as well as in similar studies where the two control 
samples are examined separately, the four D^*'t~ chan- 
nels are examined separately, the two lepton types are ex- 
amined separately, and where the data are split according 
to BABAR running period. We have examined variables 
related to Stag reconstruction, signal-side reconstruction, 
hermeticity and whole-event reconstruction, and missing 
momentum. In all cases, we observe that the simulation 
does a reasonable job describing the data. Because of the 
relative normalization scheme, small differences between 
simulation and data have no detrimental effect on the 
analysis. 



VII. FIT OF SIGNAL AND BACKGROUND 
YIELDS 

A. Fit Overview 

Signal and background yields are extracted using an 
extended unbinncd maximum likelihood fit to the joint 
( TO miss> Ip?I) distribution. The fit is performed simulta- 
neously in the four signal channels and the four D** con- 
trol samples. Two two-dimensional probability density 
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FIG. 8: Kinematic control sample plots: (a) \pi\; (b) q 2 ; (c) |p m i ss |; (d) m^ssl (e) -Ecxtra; (f) JTimiss m the £>* channels, assuming 
that the soft 71-/7 had been lost; multiplicity of (g) charged tracks and (h) neutral clusters used to reconstruct the -Stag- In 
all plots, the points with error bars are the data and the solid histogram is the simulation, scaled to the data luminosity. 
Good agreement is seen between data and simulation in a variety of variables corresponding to reconstruction, kinematics, 
and hermeticity requirements. Small differences between data and simulation cancel in the relative measurement and have 
no detrimental effect on the analysis. The large steps in (a) and (b) are due to the combination of two control samples, as 
described in the text. The structure in (g) is caused by the larger efficiency to reconstruct charged -Stag candidates — with an 
odd number of charged tracks — than neutral candidates, while the prominent even-odd structure in (h) is due to the fact that 
most neutral clusters correspond to the process ty° — » 77 and so appear in pairs. 



functions (PDFs) are presented in Section IVII Bl each 
component in the fit (listed below) is described by one of 
these two PDFs, with parameters determined from fits 
to simulated event samples. A set of constraints, de- 
scribed in Section fVII C[ relate fit components in differ- 
ent reconstruction channels. These constraints are also 
determined from MC samples, except for parameters de- 
scribing the amount of D* feed-down into the D£~ signal 
channels, which are determined directly by the fit to data. 

Tables [XT] and IIIII summarize the parameterization of 
the fit in the four signal channels and the four D** con- 
trol samples, respectively. In each of the four signal chan- 
nels, we describe the data as the sum of seven compo- 
nents: Dt-V t , D*t-V t , Dt-v u D*£~V t , D**(£-/t~)V, 
charge-crossfeed, and combinatorial background. The 
four D** control samples are described as the sum of five 
components: D**(£~/r~)V, D(£-/t~)V, D*{£-/t~)V, 



charge-crossfeed, and combinatorial background. Each 
of these components is described by one of the two PDFs 
given in Section [VII Bl with the numerical parameters of 
the 32 PDFs determined from independent MC samples. 
The charge-crossfeed components in the two D£~ signal 
channels are described by a single PDF, with common 
parameters for D°£~ and D + £~, as are the two D*£~ 
charge-crossfeed components and the four D^i^n com- 
ponents; the four combinatorial background components 
in the signal channels arc described by a single PDF with 
common parameters, as are the four in the D** control 
samples. 

B — ► D(*^t~V t events feeding up into the D** control 
samples are expected to contribute 1.8 ± 0.6 events in 
the four channels together, so these events are combined 
with the light lepton contribution. In both the control 
samples and in the signal channels, B — > D**t~T> t events 
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TABLE II: Components of the signal extraction fit in the 
signal channels, and their approximate abundances in our BB 
MC sample. The structure of the fit is identical between the 
B~ and B° channels. There are seven components in each of 
the four signal channels. 



Abundance in 



Channel Source BB MC (%) 





D* u t~T7 t signal 


5 




D°t~V t signal feed-up 


0.5 




D*°t~Vi normalization 


90 




D°rv e feed-up 


2 




D** '(IT j 't~)V feed-down 


3 




Charge-crossfeed 


0.5 




Combinatorial background 


1 




D u t~V t signal 


3 




D*°t~V t signal feed-down 


3 




D°£~Vf normalization 


25 




D*°£~Vi feed-down 


60 




D**l~Vi feed- down 


2 




Charge-crossfeed 


2 




Combinatorial background 


o 

z 


D* + £~ 


D* + t~V t signal 


5 




D + t~V t signal feed-up 


0.1 




D* + t~Vi normalization 


90 




D + £~Ve feed-up 


0.5 




D*~*l~T>i feed- down 


3 




Charge-crossfeed 


0.1 




Combinatorial background 


2 


D+e~ 


D + t~V t signal 


5 




D* + t~T> t signal feed-down 


2 




D + £~Ve normalization 


45 




D* + £~Vt feed-down 


40 




D**l~Vi feed- down 


6 




Charge-crossfeed 


1 




Combinatorial background 


3 



are expected to contribute 3.5%-4.5% of the total D** 
yield; these events are combined with the light lepton 
contribution, and the amount of D**t~v t is varied as a 
systematic uncertainty. 

The fit has 18 free parameters: four signal branching- 
fraction ratios R, one for each meson; four B — > 
D^*H~Vt normalization yields; four B — > D**l~T>i back- 
ground yields; four combinatorial background yields, one 
in each of the four D** control samples; two parame- 
ters describing D* =>• D feed-down, one for charged B 
modes and one for neutral B modes. The combinatorial 
background yields in the four signal channels are fixed in 
the fit to the expected value from simulation, as are the 
charge-crossfeed backgrounds in both the signal channels 
and D** control samples; variation of these backgrounds 
is treated as a systematic uncertainty below. 

We also perform a second, B~-B° constrained, fit, by 



TABLE III: Components of the signal extraction fit in the D** 
control sample channels, and their approximate abundances 
in our BB MC sample. The structure of the fit is identical 
between the B~ and B° channels. There are five components 
in each of the four D** control sample channels. 







Abundance in 


Channel 


Source 




V ir £ 


D**£~Vg 


60 




D*°£~V e feed-up 


18 




D°£~Ve feed-up 


2 




Charge-crossfeed 


1 




Combinatorial background 


on 


r>0 (1/7- 

V 7T £ 


D**t~Vi 


70 




D*°£~V e feed-up 


10 




D°£~Ve feed-up 


3 




Charge-crossfeed 


2 




Combinatorial background 


15 


d*+kH- 


D**£-V e 


65 




D* + l~Vi feed-up 


20 




D + t"Di feed-up 


0.1 




Charge-crossfeed 


0.1 




Combinatorial background 


15 




D*Tvt 


60 




D* + l~vi feed-up 


6 




D + £~Vi feed-up 


3 




Charge-crossfeed 


1 




Combinatorial background 


30 



requiring R(D+) = R(D°) and R(D*+) = R(D*°), 7 re- 
ducing the number of free parameters to 16. 



B. Probability Density Functions 

We construct an empirical model of the two- 
dimensional (m^ niss , \Pf\) PDF as the product of two 
terms: a one-dimensional function to describe the |p|| 
distribution, discussed in Section I VII B 11 and a |p||- 
dependent "resolution" function to describe the rn^^ 
distribution, to be discussed in Section [VII B 21 For pro- 
cesses in which the only missing particle is a single neu- 
trino, the true spectrum is a delta function located 
at zero and the observed distribution is a pure resolution 
function. For components with multiple missing parti- 
cles, the observed rn^^ distribution is the convolution of 
the physical nif^^ spectrum with our detector resolution. 
The PDFs presented below are used to describe both of 
these physical cases, with different numerical parameters 
describing the different behaviors; these two PDFs are 
flexible enough to describe the variety of physical and 



7 This constraint follows from isospin symmetry in both the signal 
and normalization modes but is more general. 



17 



resolution processes needed in this analysis. 



One-dimensiona 



Parameterization 



We use a generalized form of a Gaussian to model the 
|p|| distribution. The Gaussian distribution, 



model of the lepton spectrum above by a "resolution" 
function in m^ niss , where the resolution is a function of 
|p||. Allowing the parameters of the resolution function 
to be functions of |pj| produces a correlation between the 
two fit variables, and it is these parameters which allow 
the PDFs to describe such a wide variety of shapes. 

Using the model of the lepton spectrum W(|p||) intro- 
duced above, we construct the PDFs as: 



G(\pi\) oc exp 



Pa 



(11) 



has the same general properties as our distributions: it 
rises smoothly from zero to a peak value and then falls 
smoothly back to zero again. Here, po represents the 
value of |p|| for which Q peaks and a represents the width 
of the Gaussian distribution. 

This gross agreement is not enough, however, so we 
define a modified Gaussian function, 



Pl(|p!l,<x S s) = ft(|p!|) X 



/i(Ip?I)Si(Ip?I 



s) + 



(i-/i(|pJI))ft(|p2l;"&J n- r »'i 



and 



H(|p||) oc exp 



Po 



ff(|pil) 



"(ip?d n 



(12) 



where, for convenience, we have absorbed the constant 
factor of 2 into the definition of cr(|pj|). By allowing the 
width and exponent of the Gaussian to be functions of 
|p||, we are able to describe a greater variety of shapes. 
Specifically, we take f(|pj|) to be a linear function, 



IpJI) 



vh - VL 
2.4GeV/c 



(13) 



where i>l and vh are the values of the exponential term 
at the low and high endpoints of |p||, fixed at zero 
and 2.4GeV/c, respectively. Similarly, we parameterize 
(t(|p||) as a bilinear function, 



<|P||) = 



Po 



<?o 



2.4GeV/c-j9o 



IpII < po 

(\p*e\ -Po) IpH >Po 

( 14 ) 

where ox, &o, and <th represent the widths of the Gaus- 
sian at |p|| = 0, |p|| =po, and |p|| = 2.4GeV/c, respec- 
tively. Even though this parameterization is discontinu- 
ous at the point |p|| = p , the resulting function 7Y(|p||) 
remains smooth since the numerator in the exponent, 
(|p|| — po), goes to zero at the same point. 

The |p|| parameterization therefore has six free pa- 
rameters: po, the peak; vr, and vh , describing the expo- 
nential term; and <tl, ctoi an d &h, describing the width. 
When performing fits using this PDF, we integrate Tt 
numerically to compute the normalization. 



2. Two-dimensional PDF Parameterization 



We construct two types of two-dimensional PDF, 
7M|p|l>"& iss ) and ^Mlpll^miss) b y multiplying the 



■P2(\pt\,m 2 miss ) = H(\p* e \) x 



/i(|P/l)0i(|pJ|;m 



2 - ) 

miss / 



s) + 



(l-/i(|pj|)-/a(|p;i))&(|pj|;n^i»)] • (16) 

Here, the functions Q\ and Qi are Gaussians and Qb is 
a bifurcated Gaussian (Gaussian with different a param- 
eters on either side of the mean), respectively; all are 
functions of m^ iss , with parameters dependent on |p||. 

The \p* t \ dependence of the various parameters of Gi,2 
and Qi, is listed in Table IIVI The total number of free 
parameters for V\ is 18: six for 7i(|p||), five each for 
Qi and Q2, and two for f\. The total number of free 
parameters for Vi is 24: six for H(|p||), five each for Q\ 
and Q2, four for Qb, and two each for /1 and f%. 

We use the simpler PDF, V\, to model most of the 
scmileptonic fit components (22 out of 32), as well as 
the charge-crossfeed and combinatorial backgrounds. For 
the remaining ten components, however, the more com- 
plicated parameterization is required to adequately 
describe the to^; ss tail. Eight of these components are 
the ones in which the only missing particle is a single 
neutrino, 



D*°£- 
D*+t 
D* 
D* 



D*°e~ 

- D*+t 

D*°TT°, 

D*+Tr° 



D°£-V e => D°£- 
D+£-V e => D+l- 
D**£-V f => D°tt°£- 



D* 



and the remaining two are components in which a single 
neutrino and a soft ir° or 7 are missing, 



D*°£~V e => D°£- D*+t 



Vf 



D+t 
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TABLE IV: |p|| dependence of the m^iss PDF parameteri- 
zaion. The form of f-z is chosen to allow the Gb term to con- 
tribute at low |p||, but to drive this term rapidly to zero as 
|p£ | increases. The form of oh is chosen to allow for a long tail 
towards high vn^ ss at low |p||, but to drive this term rapidly 
to zero as |pj| increases (note that there is no problem having 
a approach zero since the amplitude of this term goes to zero 
as well; the result is finite and well-behaved). 7V pa r gives the 
number of free parameters for each term separately. 



Function Parameter Dependence on |p|| 




Gl,2 
Si, 2 


mean 
a 


quadratic 
linear 


3 
2 


Gb 
Gt 


mean 

<?L 


constant 
constant 


1 
1 


Gb 


on 


o~ho • 


1 ( Ip2I V 


2 


Pl,2 


h 


linear 




2 


V-2 


h 


p /2.4GeV/c-| P ;iV 
^ 2.2GeV/c J 


2 



C. Crossfeed Constraints 

We apply a number of constraints in the fit, relating 
the event yields between different reconstruction chan- 
nels in order to make use of all available information. 
These constraints help to maximize our sensitivity, par- 
ticularly to the B — > Dt~T> t signals where the dominant 
backgrounds are due to feed-down. There are 20 such 
constraints in the fit, corresponding to 20 different ways 
in which a true B — > D/ D* / D**£~Vi event can be re- 
constructed with the wrong final-state meson, either as 
feed-down (D* =4> D and D** =>• D/D*) or as feed-up 
(D D*/D^tt° and D* DWtt ). 

These constraints are implemented in the fit by requir- 
ing that the number of events of type j correctly recon- 
structed in the i th channel (iVy ) is related to the number 
of events of type j reconstructed in a crossfeed channel 
*' (Ni<j) by 



Nij-U 



(17) 



where fi^i'j is a crossfeed constraint relating the two 
yields. The crossfeed constraints ft^i'j are linearly re- 
lated to the misreconstruction probability. For feed- 
down processes, in which the probability to lose a low- 
momentum 7T° or 7 is high, fi—tyj typically takes values 
between 0.2 and 1.0; for feed-up processes, in which the 
probability to reconstruct a fake it or 7 in a narrow mass 
window is low, fi—>i'j typically takes values between 0.01 
and 0.1. 

The values for most of the fi-*i>.j terms are taken 
from simulation, but, in order to reduce systematic ef- 
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FIG. 9: Projections of the PDF from fits to MC samples. 
The left plots show projections onto m^^, while the right 
plots show projections onto |pj|. Shown are projections for 
four of the PDFs used in the fit: (a) D*°l~Vi => D*°£~, 
(b) D*°rv e D°r, (c) D* t~V t => D*°r, and (d) 
D**£~Ve => D*°£~ . The MC sample is shown as points, and 
the projection of the fit is shown as a curve. Note the sharp 
peak at m^ iss = in (a), while the peak in (b) is somewhat 



spread out and shifted to larger values of m n 
the lost 7T° or 7 from D*° decay. 



because of 



fects, the values of the dominant feed-down components, 
B — » D*£~vi reconstructed in the D£~ signal channels, 
are left free in the fit to data. We also use the float- 
ing values of these D* feed-down constraints to apply a 
small first-order correction to the corresponding signal 
feed-down constraints describing B — > D*t~V t recon- 
structed in the D£~~ channels; in this way, we use the 
high-statistics D*l~Vg samples to improve our knowledge 
of the signal feed-down probability. 



D. Projections of the Fit to Data 

Figures [TUrTHl show projections of the B~-B°- 
constrained fit. Figure 1101 shows projections in m,^ 
for the four signal channels, showing both the low 
region, which is dominated by the normalization modes 
B — > D^£~T7(, and the high w^ iss region, which is dom- 
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FIG. 10: (Color online) Distributions of events and fit projec- 
tions in m^jjgg for the four final states: D*°£~ , D°£~ , D* + £~ , 
and D + £~ . The normalization region m^ iss fa is shown with 
finer binning in the insets. The fit components are combinato- 
rial background (white, below dashed line), charge-crossfeed 
background (white, above dashed line), the B — > D£~Ve nor- 
malization mode (// hatching, yellow), the B — * D*£~Vt nor- 
malization mode (\\ hatching, light blue), B — > D**t"vi 
background (dark, or blue), the B —* Dt~V t signal (light 
grey, green), and the B — > D*t~V t signal (medium grey, ma- 
genta). The fit shown incorporates the B~-B° constraints. 



inated by the signal modes B — > D^*'r~V T . Figures [TT] 
and ll2l show projections in |pj| for the normalization and 
signal regions, respectively, and Figs . [T51 and [HI show pro- 
jections of both m^ iss and |p|| for the four D ** control 
samples. In all cases, we see that the fit does a reason- 
able job of describing the observed event sample, both in 
background-dominated and signal-dominated regions. 




FIG. 11: (Color online) Distributions of events and fit pro- 
jections in |pj| for the four final states D*°£~ , D°£~ , D* + £~ , 
and D + £~, shown in the normalization region, m^iss < 



1 (GeV/c 



The fit components are shaded as in Fig. [KM 



VIII. 



SIGNAL EXTRACTION AND 
NORMALIZATION 



The fit described in Section IVlII directly measures, for 
each signal mode, the ratio of the number of signal events 
in the data sample, N s i s , to the number of correspond- 
ing normalization events, N norm . We measure the signal 
branching-fraction ratios R as 



R : 



1 



£ sig/<Sn 



I V£V T ) 



(18) 



where the relative efficiency e s ig/e n orm is calculated from 
signal MC samples as 
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FIG. 12: (Color online) Distributions of events and fit pro- 
jections in |pj| for the four final states D*°£~ , D°£~ , D* + £~ , 
and D + l~ , shown in the signal region, m^ iss > 1 (GeV/c 2 ) 2 . 
The fit components are shaded as in Fig. [TO] 



FIG. 13: (Color online) Distributions of events and fit pro- 
jections in m^igg for the four D** control samples D*°n £~ , 
D°TY°r, D t+ TT°r, and D + ir°r. The fit components are 
shaded as in Fig. \W\ 



iV sig 



A/rcco /A/S on 
Jv norm/ iv nOTm 



(19) 



Here, the N gen are the numbers of simulated events, and 
the jV reco are the numbers of reconstructed events, in- 
cluding both correctly reconstructed events and contri- 
butions from feed-up or feed-down. Crossfeed is not a 
large effect, however, because both the numerator and 
denominator in this relative efficiency receive crossfeed 
contributions, and the net result tends to cancel (this 
cancellation is not exact, since the D* momentum spec- 
tra are not identical between signal and normalization 
modes, but these differences are already accounted for in 
our normalization procedure). 



Signal efficiencies are given in Table [Vj The relative 
efficiencies for the two B — > Dt~V t modes are much 
larger than unity because of the q 2 cut, which is » 98% 
efficient for signal events but rejects about 50% of the 
B — > D£~T>i normalization events, as seen in Fig. [lja). 
The q 2 cut has a similar, but less pronounced, effect on 
the D* modes, but, due to the lower efficiency for identi- 
fying secondary leptons in the signal modes, the resulting 
relative efficiency is close to unity. 



IX. SYSTEMATIC UNCERTAINTIES 

Table IVII summarizes all of the systematic uncertain- 
ties considered in this analysis. Because our signal is ex- 
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FIG. 14: (Color online) Distributions of events and fit pro- 
jections in |p|| for the four D** control samples D*°ir°£~, 
D°ir°r, D* + TT°r, and D + TY°r. The fit components are 
shaded as in Fig. \W\ 



tracted and normalized relative to B — > D^£~ve, many 
sources of systematic uncertainty — especially those re- 
lated to reconstruction efficiency — are expected to cancel, 
either partially or completely, when we take the ratio. 

We describe the individual contributions to the sys- 
tematic uncertainty below. We divide the systematics 
into two broad categories: additive and multiplicative. 
Additive systematic uncertainties are those which affect 
the fit yields and therefore reduce the significance of the 
measured signals. Multiplicative uncertainties affect the 
normalization of the signals and the numerical results but 
not the significance. 



A. Additive Systematic Uncertainties 

In order to estimate additive systematic uncertainties, 
we perform an ensemble of fits to MC event samples. For 
each source of uncertainty, we perform a number of tests 
where we modify, as appropriate, the fit shapes, crossfeed 
constraints, and the combinatorial background yields (all 
of which are fixed to MC-derived values in the nominal 
fit) and perform a signal fit. By doing a large number of 
such tests and studying the distribution of fit results in 
these ensembles, we are able to estimate the systematic 
uncertainties. In all of these ensembles, we take the RMS 
of the observed distribution, relative to the corresponding 
mean fit value, as the systematic uncertainty. 



1. Monte Carlo Statistics 

In order to study the systematic uncertainties due to 
limited Monte Carlo statistics, we perform two ensembles 
of fits. In the first ensemble, we perform a variation of 
the PDF shapes. Each of the 37 PDFs are independently 
varied by generating new values for each of the 18 or 24 
shape parameters according to the uncertainties in the 
PDF fit, taking into account correlations between the 
fitted parameters. In the second ensemble, we vary each 
of the feed-up and feed-down constraints according to 
their statistical uncertainties. 

Figure [15] shows distributions of fit results for the en- 
semble of PDF shape fits. 



TABLE V: Relative signal efficiencies e s ig/e n orm for the four 
signal modes. 



Sig 


nal mode 




^-sig/^norm 


B~ 


- D\- 


Vr 


1.85 ±0.02 


B- 


-» D*°T 


~Vt 


0.99 ±0.01 


B l > 


-> D+T' 


~'r 


1.83 ±0.03 


B° 


-^D*+T 




0.91 ±0.01 



2. Combinatorial Background Modeling 

Table IVlTl summarizes the physical sources of combina- 
torial background considered in this analysis, including 
their approximate abundances in our BB MC sample 
after all event selection. In order to study systematic ef- 
fects, we perform an ensemble of fits, reweighting events 
from the various combinatorial sources. 

In total, the two-body B decays B -► D^+dMM 
and B — > £)(*)£>(*) constitute approximately 45% of the 
total combinatorial background yield, while the three- 
body decays B — > D^D^K contribute another 15%. 
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TABLE VI: Contributions to the total systematic uncertainty. The additive systematic uncertainties represent uncertainties 
on the fit yield, and therefore reduce the statistical significance of the results. The multiplicative systematic uncertainties 
represent uncertainties on the normalization, so they affect the numerical results but not the statistical significance. The first 
four columns summarize errors on the individual branching-fraction ratios; the last two columns summarize errors on the B~- 
B° constrained measurement. The totals here refer to errors on the branching-fraction ratios R; the errors on B(B ~* D^rVe) 
(discussed in Section [X) only apply to the absolute branching fractions, and are not included in the quoted total error. 



Source 




D U T1S 


Fractional uncertainty (%) 

D+TU D* + TV 


Dtp 


D'tv 








Additive systematic uncertainties 






MC stat. (PDF shape) 


11.5 


8.4 


4.5 


1.8 


6.9 


4.7 


MC stat. (constraints) 


4.2 


1.9 


6.1 


1.3 


3.6 


1.4 


Comb. BG modeling 


7.5 


4.1 


11.5 


2.6 


9.1 


2.9 


D** modeling 


5.7 


0.5 


1.6 


0.2 


3.0 


0.4 


B — > D* form factors 


1.9 


0.7 


0.8 


0.2 


1.4 


0.4 


B — > D form factors 


0.2 


0.7 


0.6 


0.2 


0.3 


0.4 


2 -1 IT 

m miss tail modeling 


1.5 


0.5 


1.2 


0.4 


1.6 


0.1 


7T° crossfeed constraints 


0.5 


1.1 


0.5 


0.9 


0.5 


1.0 


D** feed-down 


0.4 


0.1 


0.1 


0.3 


0.2 


0.2 


D**t~v t abundance 


0.4 


1.3 


0.3 


0.2 


0.3 


0.8 


Total additive 


15.6 


9.7 


14.0 


3.6 


12.5 


5.8 








Multiplicative systematic uncertainties 






MC stat. (efficiency) 


1.2 


1.1 


1.5 


1.1 


1.0 


0.8 


Bremsstrahlung/FSR 


0.6 


0.5 


0.3 


0.4 


0.4 


0.5 


Tracking e 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


e PID e 


0.5 


0.5 


0.6 


0.6 


0.6 


0.6 


H PID e 


0.5 


0.6 


0.7 


0.6 


0.6 


0.6 


K PID g 


0.2 


0.1 


0.2 


0.0 


0.2 


0.0 


TV PID £ 


0.1 


0.1 


0.2 


0.0 


0.1 


0.1 


K° S £ 


0.1 


0.0 


0.1 


0.1 


0.1 


0.0 


Neutral (ir° and 7) £ 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


Daughter B's 


0.1 


0.3 


0.0 


0.1 


0.1 


0.3 


B(r~ -> l-Viv T ) 


0.2 


0.2 


0.2 


0.2 


0.2 


0.2 


Total multiplicative 


1.6 


1.5 


1.8 


1.4 


1.4 


1.3 


Total 


15.6 


9.9 


14.0 


3.9 


12.5 


6.0 


B(B -> D^H-Ve) 


10.2 


7.7 


9.4 


3.7 


6.8 


3.4 



TABLE VII: Sources and approximate abundances of combinatorial background in our BB MC sample. All four signal channels 
are combined here. The third and fourth columns show what fraction of the B decays in each group have previously been 
observed. The fourth column is the product of the second and third, and indicates how much of the estimated combinatorial 
background is known from other measurements. 



Source 


% of total BG % B measured 




(relative) (absolute) 



B D { a * )+ D M ( + light hadrons) 



. . . with DJ — ► tv 


30 


90 


27 


. . .with Df -> £u((t>/r]/rj') 


10 


90 


9 


B -> DWl)W ( + light hadrons) 


35 


65 


25 


Both B -» D ( *kv 


15 


100 


15 


Fake lepton 


5 








Other misreconstructed 


5 








Total 






75 



23 




R(D) R(D*) 

FIG. 15: Distributions of fit results for systematic uncer- 
tainties due to Monte Carlo statistics, shown for (a) B~ —+ 
D t~V t , (b) B~ -> D* t~V t , (c) B° -> D + t~V Ti (d) 
B o _^ d « +t --^^ ^ B _^ Dt - Vt ^ and ^ B j)* r "F r . 

In all figures, the branching-fraction ratio R is shown. 

Branching fractions of most of the relevant two-body B 
decays (and some of the three-body decays as well) have 
previously been measured. These branching fractions are 
listed in Table IVIII1 along with relevant branching frac- 
tions of the Df meson. 

To study systematic uncertainties related to combina- 
torial background modeling, we perform an ensemble of 
fits. In each fit, we reweight events in the simulation. 
For modes listed in Tabic rvTTT| we reweight the branching 
fraction, generating random weights from a Gaussian dis- 
tribution based on the measured value (for decays involv- 
ing a Df meson, the weight is the product of weights for 
both the B and D+ decays). For charge-crossfeed events 
(true B — >■ D^i~T>i events where the B ta , g and signal 
Z)(*) swap a charged particle), the dominant systematic 
uncertainty is not the branching fraction, but rather the 
efficiency to reconstruct the B tag with the wrong charge. 
We estimate a 10% uncertainty on the modeling of this 
process, i.e., we generate weights for these events using 
a Gaussian with a mean of 1 and a width of 0.1. For 
doublc-semileptonic events, with both B mesons decay- 
ing to D^t ~ve, again, the dominant uncertainty comes 
from the probability to misreconstruct a B ta , g candidate 
in this event, and we assume a 10% uncertainty on this 
number as well. For events in which the signal lepton 
is misidentified, we assign a 10% uncertainty; the typ- 
ical fake rate measured in data is 2%-3%, with data- 
simulation discrepancies generally 10% or less in the mo- 
mentum ranges of interest. For all remaining sources 
of combinatorial background, including high-multiplicity 



TABLE VIII: Branching fractions of and two- and three- 
body B decays contributing to combinatorial background. 
Measurements are taken from |3l| . except (f) which are taken 
from [33. The last column gives the branching fraction used 
to generate the BABAR MC sample, where each number is 
shown in the same scale as the corresponding number in the 
second column. 



Mode 






B 








MC 


D+ 




TV 




( 6.4 ± 1.5 ; 


X 


10" 


J 


7.0 


D+ 


— » 


Tjlv 




( 2.5 ± 0.7 ; 


X 


10" 


-2 


2.6 


D+ 




n'lv 




( 8.9 ± 3.3 ; 


X 


10" 


-3 


8.9 


D+ 




4>tv 




( 2.0 ± 0.5 ; 


X 


10" 


-2 


2.0 


D+ 








( 5.0 ± 1.9 ; 


X 


10" 


-3 


4.6 


B+ 





D"Df 




( 1.3 ± 0.4 ; 


X 


10- 


2 


1.06 


B + 




D°D* S + 




( 9 ± 4 ; 


X 


10" 


-3 


9.1 


B + 




D*°D+ 




( 1.2 ± 0.5 ; 


X 


10" 


-2 


1.02 


B + 




D* D* S + 




( 2.7 ± 1.0 ; 


X 


10" 


-2 


2.28 


R+ 




D **o D (*)+ 




( 2.7 ± 1.2 ; 


X 


10" 


-2 


3.0 




— > 


D°D* + K° 




( 5.2 ± 1.2 ; 


X 


10" 


-3 


5.2 


13 




D*°D*+K° 




( 7.8 ± 2.6 " 


x 


10" 


-3 


7.8 


D + 

B 




D° D°K+ 




(1.37 ± 0.32 ; 


x 


10" 


-3 


1.9 


B+ 


— > 


D*°D*°K+ 




( 5.3 ± 1.6 N 


x 


10" 


-3 


5.3 


B+ 




D*°D°K + 




( 4.7 ± 1.0 ; 


X 


10" 


-3 


4.8 


B+ 




D*~D + K+ 




( 1.5 ± 0.4 ; 


X 


10" 


-3 


0.5 


B [) 




D~Df 




( 8.0 ± 3.0 ; 


X 


10" 


•>> 


7.4 


B° 




D*~D+ 




(1.07 ± 0.29 ; 


X 


10" 


-2 


1.03 


B° 




D~D* + 




( 1.0 ± 0.5 ; 


X 


10" 


-2 


0.74 


B° 




D*~D* S + 




( 1.9 ± 0.5 ; 


X 


10" 


-2 


1.97 


B° 




D~D°K+ 




( 1.7 ± 0.4 ; 


X 


10" 


-3 


1.7 


B° 




D-D*°K+ 




( 3.1 ± 0.6 ; 


X 


10" 


-3 


3.1 


B° 




D *- D *o K + 




(1.18 ± 0.20 ; 


X 


10" 


-2 


1.18 


B° 




D*-D + K° 




( 6.5 ± 1.6 ; 


X 


10" 


-3 


8.1 


B° 




D*~D*+K° 




( 8.8 ± 1.9 ; 


X 


10" 


-3 


8.8 


B° 




D* + D*~ 


(t) 


( 8.1 ± 1.2 ; 


X 


10" 


4 


8.3 


B° 




D+D*- 


(t) 


(10.4 ± 2.0 ; 


X 


10" 


-4 


6.7 


B° 




D+D~ 


(t) 


( 2.8 ± 0.7 ; 


X 


10" 


4 


2.7 



B Dhh(h . . .) and B — > DDhh(h . . .) (where D here 
represents any charm meson and h any light meson) and 
other misreconstructed events, we assume a 50% uncer- 
tainty in the relevant rates. 

In each test, we fit the reweighted MC sample to gen- 
erate new PDF shapes and recalculate the expected yield 
of combinatorial events in each channel. Figure [TBI shows 
the effect of this reweighting on the combinatorial BG 
PDF in the signal channels. We note that the reweight- 
ing affects the normalization of the charge-crossfeed back- 
grounds but not the shape. 



3. D** Modeling 

We generate an ensemble of D** models, sampling 
from the distribution in Table ILXl This model is based on 
the current world averages [la , |28| but includes informa- 
tion from selected recent measurements [33j and imposes 




|p|| (GeV/c) 

FIG. 16: Combinatorial background modeling variation for 
the four signal channels, showing the projections onto (a) 
TO miss) an d (b) |p||. In both figures, the MC sample without 
reweighting is shown as data points, while the light and dark 
shaded regions show the la and 2a envelopes of the ensemble 
of reweighted PDF shapes. 

isospin symmetry between charged and neutral B modes. 

For each test, we generate random numbers for the six 
exclusive modes (D, D*, and the resonant D** states), 
independently for B + and B° decays. We then saturate 
the remaining inclusive b — > ct~vt rate with the four 
nonresonant states, maintaining the Monte Carlo ratio of 
0.1 : 0.3 : 0.2 : 0.6. Even though we are only interested in 
the B — > D**£~T/£ states, we need to generate distribu- 
tions of the B — > D^*'£~V£ branching fractions to allow 
for sufficient variations in the nonresonant states which 
are used to saturate the total rate. 

For each test, we reweight both the D**l~Vt PDFs 
and crossfeed constraints to estimate the systematic un- 
certainty. 

4- B — > £)W Form Factors 

We reweight the form factors of both signal B — > 
D^t~v t and normalization B — > D^t^vg decays. In 
both cases, we use the form factor parameterization of 
Caprini, Lellouch, and Neubert (27j . with numerical pa- 
rameters given in Section IIII1 We reweight signal and 
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TABLE IX: B — > X c (~T>e branching fractions used in the D** 
modeling systematic study. The first line, civ, represents the 
inclusive semileptonic branching fraction. For the six lines 
representing the D, D* , and D** resonant states, the distri- 
bution of these branching fractions is taken to be Gaussian 
with the given mean and width. For the last four lines, rep- 
resenting the nonresonant D** states, the ranges of variation 
are not shown in this table; their distribution is determined 
by the inclusive rate and the other exclusive modes, as de- 
scribed in the text. The generated branching fractions, B gcn , 
are the same for charged and neutral B mesons. All numbers 
are in %. 



Mode 




B{B°) 




B(B~ 


) 






(* 


a 




a 


civ 


10.4 


10.17 





10.9 





D 


2.10 


2.14 


0.14 


2.29 


0.16 


D* 


5.6 


5.54 


0.25 


5.94 


0.24 


Lh. 


0.56 


0.47 


0.08 


0.58 


0.06 


Dl 


0.37 


0.35 


0.07 


0.46 


0.08 


Do 


0.20 


0.46 


0.09 


0.45 


0.09 


D[ 


0.37 


0.85 


0.20 


0.83 


0.20 


D*n° 


0.1 


0.03 




0.029 




Dtt 


0.3 


0.09 




0.088 




D*n ± 


0.2 


0.06 




0.058 






0.6 


0.18 




0.175 





normalization modes simultaneously and generate new 
PDFs, crossfeed constraints, and relative efficiencies. 

5. JTimiss Tail Modeling 

Studies in the two kinematic control samples show ac- 
ceptable overall agreement between data and simulation 
for the m^ iss resolution [see Fig. 0(d)], but suggest that 
the simulation may underestimate the ratio of the num- 
ber of events in the large rn^iss tail region to the num- 
ber of events near m^ iss = 0. We estimate that this 
tail component of the resolution may be underestimated 
by up to 10%. We study systematic effects related to 
this by reweighting events at large Ti^ iss , greater than 
1 (GeV/c 2 ) 2 , up by 10%, modifying the PDF shapes for 
B -> Di-y t and B -> D^£~Ve. We perform a fit with 
these modified PDFs and take the difference from the 
nominal fit as a systematic uncertainty. 

6. 7T° Efficiency and Crossfeed Constraints 

While the systematic uncertainties due to detector ef- 
ficiencies (described in more detail in Section IIX B 3p 
are primarily multiplicative, the efficiencies for tt° re- 
construction have a large impact on the feed-down ef- 
ficiencies and therefore the fit yields. This effect can be 
enhanced by the fact that the feed-down constraints are 
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defined as the ratio of the number of events reconstructed 
in the D£~ channel to that in the D*£~ channel, which 
move in opposite directions as the tt° efficiency is varied. 

We generate an ensemble of fits by varying the tt° effi- 
ciency within its uncertainty, 3.0% per ir°. The resulting 
changes in the feed-down constraints for both signal and 
background modes are propagated through the signal fit 
to estimate the resulting systematic uncertainties. 

7. D**£~V e Feed-down 

We assign an additional systematic uncertainty on 
D**£~~p£ feed-down rates due to the fact that the 7r° 
mesons involved in feed-down processes typically have 
low momentum, while the 3.0% systematic uncertainty 
mentioned above is derived from a control sample with a 
broad spectrum. Since we float the constraints describing 
D* D feed-down in the fit, D* feed-down processes are 
insensitive to systematic effects due to the 7r° efficiency 
at low momentum. The D** feed-down constraints, how- 
ever, are taken from simulation and can therefore be af- 
fected. 

We compare the fitted values of the D* => D feed-down 
rates to the simulation to estimate that the efficiency for 
low-momentum ir° mesons is correctly modeled to within 
10%. We generate an ensemble of fits in which we vary 
the 7T reconstruction efficiency ±10% for ir° mesons with 
momentum less than 300 MeV/ c. We generate new PDFs 
and feed-down constraints which we propagate through 
the signal fit to estimate the systematic uncertainties. 

8. B — > D**t~V t Abundance 

We vary the fraction of B — ► D**t~V t events in 
the D** samples by generating random numbers from 
a Gaussian distribution with mean 1.0 and width 0.3, 
equivalent to a ±30% variation. For each test, we gener- 
ate new PDFs and crossfeed constraints to estimate the 
systematic uncertainty. 

B. Multiplicative Systematic Uncertainties 

1. Monte Carlo Statistics 

The dominant multiplicative systematic uncertainty is 
due to limited Monte Carlo statistics. The various MC 
samples are independent of one another, so that there is 
no cancellation between the signal and normalization. 

2. Bremsstrahlung and Final-State Radiation 

Based on a control sample of identified electrons and 
studies in MC samples, we estimate the uncertainty on re- 
construction efficiency due to Bremsstrahlung and final- 



state radiation effects to be 2.1%. This uncertainty ap- 
plies to both signal and normalization modes, however, 
and so the effect on the relative efficiency is expected 
to cancel. The fractions of events in which a photon 
is radiated are nearly the same between signal and nor- 
malization modes, within statistical precision of 10%; we 
therefore treat the uncertainty between signal and nor- 
malization modes as 90% correlated to calculate the final 
systematic uncertainty. 

3. Detector Efficiencies 

We estimate systematic uncertainties related to the de- 
tector efficiencies — track and neutral reconstruction and 
charged particle identification — by studying these effi- 
ciencies in several control samples in both data and sim- 
ulation. We correct the MC efficiencies to match those 
seen in the data, and we take the statistical precision of 
these studies as an estimate of the systematic uncertainty 
on absolute efficiencies. 

Since we normalize our signals to B — > D^*>l~T>i, we 
calculate systematic uncertainties on the relative effi- 
ciency, treating uncertainties on the signal and normal- 
ization modes as correlated. The degree of correlation, 
and therefore, the degree to which the uncertainty can- 
cels, is determined by the kinematics of the two samples. 
For most of the final state particles, the kinematic dis- 
tributions are very similar between signal and normal- 
ization modes and so the systematic uncertainty cancels 
almost entirely. For the charged leptons, however, the 
momentum spectra are very different between signal and 
normalization (see Fig. [2]), and so the associated system- 
atic uncertainty is larger. 

4- Hadronic Daughter Branching Fractions 

We reconstruct both signal and normalization modes 
using the same set of final states, so uncertainties due 
to the branching fractions of these states very nearly 
cancel. (The momentum spectra are slightly dif- 

ferent between signal and normalization modes, so this 
cancellation is not perfect.) We take the uncertainty 
on each of the reconstructed D*, D, K®, and tt° decay 
modes from [l6j and propagate each of these uncertain- 
ties through to the relative efficiency, using the relative 
abundance of each decay chain in the signal and normal- 
ization MC samples to determine the correlation and the 
degree of cancellation. 

5. Leptonic r Branching Fraction 

The r branching fraction B(t~ — > l~vgv T ) appears 
only in the denominator of Eq. [18] and therefore con- 
tributes a 0.2% systematic uncertainty on all modes [l6| 
without cancellation. 



26 



X. RESULTS 

Table [X] summarizes the results from two fits, one in 
which all four signal yields can vary independently and 
the second B~-B° constrained fit with R(D+) = R(D°) 
and R(D* + ) = R(D*°). We observe approximately 
67 B — > Dt~V t and 101 B — > D*t~V t signal events 
in this i? - -B -constrained fit, corresponding to signal 
branching-fraction ratios of R(D) = (41.6 ± 11.7 ± 5.2)% 
and R(D*) = (29.7 ± 5.6 ± 1.8)%, where the first er- 
ror is statistical and the second systematic. Normaliz- 
ing these to known B° branching fractions, 8 we obtain 
B(B -> Dt~V t ) = (0.86 ± 0.24 ± 0.11 ± 0.06)% and 
B(B -> D*t-V t ) = (1.62 ± 0.31 ± 0.10 ± 0.05)%, where 
the third error is from that on the normalization branch- 
ing fraction. 

Table [X] also gives the significances of the signal 
yi elds. Th e statistical significance is determined from 
^/2A(ln£), where A(ln£) is the change in log-likelihood 
between the nominal fit and the no-signal hypothesis. 
The total significances are determined by including the 
systematic uncertainties on the fit yields in quadrature 
with the statistical errors. In the B - -B°-constrained fit, 
the signal significances are 3.6a and 6.2er for R(D) and 
R(D*), respectively. 

The statistical correlation between R(D) and R(D*) is 
—0.51 in the i?~-S°-constrained fit. This correlation is 
due to the fact that most of the events at large 77i^ iss are 
either B — ► Dt~V t or B — > D*t~v t signal events, and 
increasing either of the two signal yields in the fit neces- 
sarily decreases the other. The systematic uncertanties 
have a correlation of —0.03 between R{D) and R(D*); 
most of the systematic uncertainties have large negative 
correlations for the same reason that the statistical un- 
certainty does, but the combinatorial background uncer- 
tainty affects both signal yields in a coherent manner and 
so contributes a large positive correlation. The sum of the 
two branching fractions, taking all correlations into ac- 
count, is B(B -> D^t-V t ) = (2.48±0.28±0.15±0.08)%. 

Figures [17] and [18] show the observed q 2 distributions 
in the four signal channels in the low and high rn^^ re- 
gions, respectively. The histograms in these figures are 
taken from MC samples of the various components, with 
each component scaled to match the yield in the B~-B°- 
constrained fit; since q 2 is not a fit variable, we cannot 
show a projection of a continuous PDF as was done in 
Figs. [TUi rnn As before, we observe good agreement be- 
tween the data and the expectation from simulation, in 
both the low and high m^ iss regions. Since the q 2 distri- 
bution is highly dependent on the form factor model, we 
note that the CLN model describes both normalization 



We use [T3 | to normalize the four individual branching fractions. 
For the B~ — B°-constrained measurement, we use our own aver- 
ages of_the values in 16]: B(B° -> D+£~V t ) = (2.07 ± 0.14)% 
and B(B° -» D*+£-v e ) = (5.46 ± 0.18)%. 




q 2 (GeV/c 2 ) 2 

FIG. 17: (Color online) q 2 distributions of events in the four 
final states D*°r, D°£~, D*+£~, and D+£~, shown in the 
normalization region, mn liss < 1 (GeV/c 2 ) 2 . The data are 
shown as points with error bars. The shaded histograms are 
taken from MC samples with normalizations from the fit to 
data. The components are shaded as in Fig. 1101 



and signal events within the available statistics. 

Table [Xll summarizes the results of several crosschecks, 
including splitting up the sample according to lepton fla- 
vor, lepton charge, and data-taking period. We have 
done these checks by performing "cut-and-count" anal- 
yses, both in the data and in simulated event samples. 
In all cases, the results in data are consistent with our 
expectations from simulation. The first row in this ta- 
ble shows the fraction of events with muon candidates 
in data and simulation, both for the full event sample 
and for the signal-sensitive region in m 2 niss . Electron 
identification is more efficient than muon ID, which is 
why the muon fraction in the final sample is less than 
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TABLE X: Results from fits to data: the signal yield (N s i g ), the yield of normalization B — > D^*'l~Vt events (iV norm ), the 
relative systematic error due to the fit yields [(AR/R)nt], the relative systematic error due to the efficiency ratios [(AR/ R) e ], 
the branching- fraction ratio (R), the absolute branching fraction (6), and the total and statistical signal significances (<7 t ot 
and (Tstat). The first two errors on R and B are statistical and systematic, respectively; the third error on B represents the 
uncertainty on the normalization mode. The last two rows show the results of the fit with the B~-B° constraint applied, where 
B is expressed for the B°. The statistical correlation between R(D) and R(D*) in this fit is — 0.5f . 



Mode 




N sie 


■^V norm 


(AR/R) m 
[%] 


(AR/R) e 

[%} 


R B CTtot 
[%] [%} (<7 rt .t) 


B~ -> 


D u r-V r 


35.6±f9.4 


347.9±23.1 


15.5 


1.6 


31.4±17.0±4.9 0.67±0.37±0.11±0.07 1.8 (1.8) 


B- -> 


D*°t~v t 


92.2±f9.6 


1629.9±63.6 


9.7 


1.5 


34.6± 7.3±3.4 2.25±0.48±0.22±0.17 5.3 (5.8) 


B° -> 


D + T~V T 


23.3±7.8 


150.2±13.3 


13.9 


1.8 


48.9±16.5±6.9 1.04±0.35±0.15±0.10 3.3 (3.6) 


B° 


D* + T~V T 


15.5±7.2 


482.3±25.5 


3.6 


1.4 


20. 7± 9.5±0.8 1.11±0.51±0.04±0.04 2.7 (2.7) 


B -> 


Dt~V t 


66.9±18.9 


497.8±26.4 


12.4 


1.4 


41.6±11.7±5.2 0.86±0.24±0.11±0.06 3.6 (4.0) 


B -> 


D*t~V t 


1Q1.4±19.1 


2111.5±68.1 


5.8 


1.3 


29. 7± 5.6±1.8 1.62±0.31±0.10±0.05 6.2 (6.5) 



TABLE XI: Crosscheck studies, splitting the data according 
to lepton flavor, lepton charge, and running period. The first 
row shows the fraction of events with muon candidates for 
both data and MC samples, for both the full event sample 
and for the signal-sensitive region m^ iss > 1 (GeV/c 2 ) 2 . The 
second row shows fractions of events with positively charged 
lepton candidates, and the third row shows the fractions of 
events recorded in Run 4. In all cases, the data are consistent 
with the simulation and with expectations. 



Sample 


Full s 


ample 


High 


m 2 ■ 

' ' ''miss 


/data (%) 


/mc (%) 


/data (%) 


/mc (%) 


A 4 


40.0 ±0.9 


40.9 ± 0.4 


30.7 ±2.3 


31.9 ± 1.0 


£+ 


50.2 ± 1.0 


49.2 ±0.4 


49.3 ± 2.5 


48.9 ± 1.0 


Run 4 


44.6 ± 1.0 


47.6 ±0.5 


46.8 ± 2.5 


48.5 ± 1.2 



50%, and, at lower momenta (which generally correspond 
to larger rnf lliss ) , this efficiency difference is more pro- 
nounced; in both cases, however, the muon abundance 
is well-modelled by the simulation. The next row shows 
the fraction of positively-charged lepton candidates (ver- 
sus negatively-charged candidates), and all samples are 
consistent with the expected 50/50 split. The last row 
shows the fraction of events recorded during the Run 4 
BABAR data-taking period; Run 4 had significantly dif- 
ferent accelerator background conditions from Runs 1-3, 
which could affect missing-energy analyses. The fraction 
of events in the Run 4 subsample is consistent with ex- 
pectations: Run 4 makes up 47% of the total luminosity. 

We estimate the goodness of fit using an ensemble of 
simulated experiments. We generate 1000 event samples, 
using the nominal PDFs for the fit to data and event 
yields based on the i? - -B -constrained fit to data. We 
fit each of these samples both with and without the B~- 
B° constraints and study the distribution of — log £ in 
these fits. 

Figure [19] shows the distribution of — log£ for the two 
ensembles of fits. In both cases, the value of — log£ ob- 
tained in the fit to data is indicated with an arrow, and, 
in both cases, this value is found within the central part 



of the Monte Carlo distribution, indicating a good fit. In 
the unconstrained fit, 11.7% of the simulated experiments 
have a value of — log £ greater than the value observed 
in data, corresponding to the probability that we expect 
to observe a fit as bad, or worse, than the one actually 
observed. This probability is large, indicating an accept- 
able goodness of fit. The corresponding probability for 
the B~-B° constrained fit is 11.8%, also large. 



XI. CONCLUSIONS 

We have presented measurements of the branching 
fractions for the decays B — ► Dt~V t and B — > D*t~V Ti 
determined relative to the corresponding decays to light 
lcptons. We measure the branching-fraction ratios for 
four individual £)W states, as well as two B~-B°- 
constrained ratios 



R(D°) = (31.4 ±17.0 ±4.9)% 

R{D*°) = (34.6 ±7.3 ±3.4)% 

R(D + ) = (48.9 ± 16.5 ±6.9)% 

R(D* + ) = (20.7 ±9.5 ±0.8)% 

R(D) = (41.6 ± 11.7 ±5.2)% 

R(D*) = (29.7 ± 5.6 ± 1.8)% , 

where the first uncertainty is statistical and the second 
is systematic. The significances of these signals are 1.8cr, 
5.3cr, 3.3cr, 2.7cr, 3.6cr, and 6.2er, respectively. The sta- 
tistical and systematic uncertainties on R(D) and R(D*) 
have correlations of —0.51 and —0.03, respectively. 

From these branching-fraction ratios and known 
branching fractions of the normalization modes B — > 
D^£~T>£, we derive the absolute branching fractions 
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FIG. 18: (Color online) q 2 distributions of events in the four 
final states D* £~, D°£~, D*+£~, and D+T, shown in the 
signal region, m^ iss > 1 (GeV/c 2 ) 2 . The data are shown as 
points with error bars. The shaded histograms are taken from 
MC samples with normalizations from the fit to data. The 
components are shaded as in Fig. 1101 



B(B~ - 
B{B- - 
B(B° - 
B{B° -» 
B(B 
B(B - 



D°t-V 



D*°t-V 
• D + t — 
D*+t- 
-> Dt 



= (0.67 ±0.37 ±0.11 ±0.07)% 

= (2.25 ±0.48 ±0.22 ±0.17)% 

v T ) = (1.04 ±0.35 ±0.15 ±0.10)% 

v T ) = (1.11 ±0.51 ±0.04 ±0.04)% 

V T ) = (0.86 ±0.24 ±0.11 ±0.06)% 

V T ) = (1.62 ±0.31 ±0.10 ±0.05)% 



where the third uncertainty reflects that of the normal- 
ization mode branching fraction. 

The measurement of B(B° — > D* + t~V t ) is consistent 
with the Belle result [TtJ ■ The branching-fraction ratios 




-17000 



-16000 



Vi 
15000 



-14000 



-log£ 



FIG. 19: Distribution of — \ogC from simulated experiments, 
showing (a) the unconstrained fit and (b) the B~-B° con- 
strained fit. The observed values of — log L in the fit to data 
are indicated with arrows. The fraction of experiments with 
— log C larger than the observed value is used to estimate the 
goodness of fit. 



R{D) and R{D*) are about la higher than the SM pre- 
dictions but, given the uncertainties, there is still room 
for a sizeable non-SM contribution. 

We have also presented distributions of the lepton mo- 
mentum |p|| and the squared momentum transfer q 2 for 
B — > D^*'t~V t events. In all cases, these distributions 
are consistent with expectations based on the SM and 
the CLN form factor model with measured form factors. 
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